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ABSTRACT 


Assuming  that  the  x-coordinates  of  inpact  points  of  rounds  fired  at  a  target  cen¬ 
tered  at  the  origin,  x  =  o,  are  the  sums  of  correlated  aim  errors  and  independent  bal¬ 
listic  errors,  we  show  how  the  likelihood  function  of  the  unknown  statistical  parameters, 
associated  with  a  sample  Xj,  X2,  ....  of  inpact  points  from  a  single  burst,  can  be 

quickly  and  simply  computed.  The  likelihood  function  is  used  here  as  the  basis  for  esti¬ 
mating  the  parameters,  and  the  results  of  our  estimates,  applied  to  a  large  number  of 
computer  simulations,  are  summarized. 
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ESTIMATION  OF  AIM-ERROR  CORRELATION,  AIMING  DISPERSION 
AND  BALLISTIC  DISPERSION 


INTRODUCTION 

The  importance  of  obtaining  efficient  estimates  of  the  statistical  parameters  describ¬ 
ing  a  rapid-fire  gun  system  is  well  recognized.  Such  estimates  are  necessary  in  research 
studies  of  future  systems  and  tactics,  and  also  for  the  design  of  closed -loop  "filters"  to 
effect  corrections  at  the  gun  site  in  actual  firings,  (references  (b)  and  (c)).  In  reference 
(a)  we  began  the  study  of  the  estimation  problem  by  investigating  what  could  be  achieved 
from  the  traditional  sample  covariances  of  a  time  series  Xj,  X2> . .  .x^  ,  and  we  showed 

that  for  high  correlations  (p  «  1)  in  the  aim  error,  the  aiming  dispersion  was 

virtually  impossible  to  estimate  from  a  single  burst.  Since  rapid-fire  guns  have  a  high  o, 
the  results  of  reference  (a)  demonstrate  the  need  for  numerous  independent  experiments 
to  estimate  the  parameters  for  any  such  gun  system. 

However,  the  information  provided  in  reference  (a)  does  not  necessarily  apply  to 
cases  with  moderate  p  ,  or  moderate  burst  lengths,  and  no  attempt  was  made  in  refer- 

2 

ence  (a)  to  provide  formulas  for  estimating  p  and  o .  .  In  this  paper,  we  show  how  to 

A 

calculate  the  likelihood  function  associated  with  a  sample  x^,  x^, . . .,  x^  .  This  function 

is  then  used  to  provide  Bayes  estimates  of  the  parameters,  as  well  as  their  second 
moments.  The  latter  quantities  permit  us  to  obtain  approximate  confidence  intervals 
for  the  parameters. 

(Familiarity  with  reference  (a)  is  not  required  in  reading  this  paper.) 

THE  MODEL  AND  THE  PROBLEM 

With  a  target  centered  at  the  origin  of  a  1 -dimensional  coordinate  system,  we  imagine 
a  burst  of  N  rounds  impacting  in  points  x^,x9,  .  ...x^  .  (For  a  2-dimensional  picture, 

tlie  x's  may  denote  the  azimuth  errors  and  equally  well,  the  elevation  errors,  treated 
separately  and  independently.)  The  model  consists  of  the  assumptions  that  each  x  is 

the  sum, 

x  =&  +  1 3  (1) 

n  n  n 

of  two  stationary  Gaussian  processes  {a^}  ,  {0^}  ,  independent  of  each  other,  in  which 
represents  the  aim  error,  means  zero,  and  covariances 

l!  U,"r,+k]=,,i5|k|  '  (2) 


and  {^3^}  represents  the  ballistic  error,  means  zero,  and  covariances 
H  ^n  ^n+k  -  =  aB  *o,  k  * 
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Thu  process  }  is  often  called  "white  noise.”  There  are  exactly  three  parameters: 

2  n  2 

CTa  ,  the  aiming  dispersion;  cr^  ,  the  ballistic  dispersion;  and  p  ,  the  round -to -round 
aim  correlation. 

The  problem  is  to  estimate  the  values  of  the  3  parameters,  and  to  obtain  confidence 
intervals  for  them  from  impact  data.  Equivalently,  from  a  Bayes  viewpoint,  we  want  the 
posterior  probability  density  in  the  parameter  space,  given  a  single  sample  Xj,  x2>  . . . ,  x^ 

of  impact  points  from  a  burst  of  N  rounds. 

According  to  this  model,  the  gun  is  unbiased,  E^)  =  0  for  each  n  .  The  possibility 

of  a  non-zero  bias  is  of  no  importance  in  the  design  of  a  gun  filter,  because  any  such 
possible  bias  will  be  removed  by  the  filter,  (reference  (b)).  Furthermore,  in  sampling 

experiments  on  the  firing  range,  the  data  x ^ ,  ...,x^  can  be  replaced  by  x ^  -  x,  x^  -  x, 

...,x^  -  x”,  where  x  is  the  arithmetic  mean.  The  only  parameters  of  importance  are 
2  2 

>  and  p. 

We  confine  ourselves  to  the  estimation  problem  based  on  a  single  burst  x^, .. .,  x^,  , 
since  there  is  no  problem  in  combining  the  results  of  independent  experiments.  If 
2  2 

Lv(ca,  crR,  p)  denotes  the  likelihood  function  of  the  vth_  independent  trial  (with  the 
2  2 

same  o A>  <rR  ,  p)  for  v=  l,2,...,r,  then,  of  course,  the  combined  likelihood  function 
is  simply  the  product 


(  2 

2  t  Jr  , 

(  2 

2  N 

\°A  ' 

■b  ■  9)  ‘  JT,  Lv  1 

^ffA  ' 

"e  '  p) 

THE  LIKELIHOOD  FUNCTION 

From  the  last  section,  and  equations  (1),  (2),  (3),  we  see  that  the  sample  vector 
(Xj,  x2>  . . .,  x^)  is  Gaussian,  with  mean  (0,0,  . .  .,0)  ,  and  covariances 


cov  (x .  ,  x  .  ) 
'  l  j 


2 

aA 


P 


i-D 


(4) 


If  C  =  i‘c..}  is  the  N  x  N  covariance  matrix,  c..  =  cov(x.,x.)  ,  then  the  density  funct 
c  nJ  ij  'ir 


ij 

for  (x. ,  . . . ,  x. .)  is 
1  \ 


ion 


(X1 . XlJ°A'  ®B' 


r 


j^det  Cj 


(2n) 


N/2 


exp 


t  „-l 

-x  C  x 


(S) 
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L 


where  x  is  the  Nxl  matrix  with  elements  x  ,...,x^;  xl  is  the  transpose  of  x; 

C  *  is  the  matrix  inverse  to  C  ;  and  det  C  is  the  determinant  of  C  .  The  likelihood 
function  is  precisely  the  quantity  in  equation  (5), 


p)  -  'T'(x  .  •  •  >  Xj^j 


P) 


(6) 


(or  any  constant  multiple  thereof),  regarded  as  a  function  of  the  (Greek -letter)  parameters. 
The  observations  x^,  .  ..,x^  are  fixed,  of  course,  for  any  one  experiment. 


Hut  the  form  of  <r>  in  equation  (5)  is  practically  impossible  to  evaluate,  even  on  a 

high-speed  computer.  The  matrix  C  is  easy  to  write,  equation  (4),  but  C  *  cannot  be 
written  in  terms  of  the  parameters,  and  inverting  large  matrices  on  a  computer  is  time- 
consuming  and  subject  to  gross  round-off  errors.  Therefore,  we  again  consider  the  proba 
bility  density  of  (x^,...,x^)  and  write,  suppressing  the  Greek  letters, 

r(x  t , . . . ,  xN)  =  (x  t)  cn  2  (x2  |  x  { )  t 3  (x;}  ( x  { ,  x2). . .  m^x^,  |  x  { .....  xN  _  , )  (7) 


where  each  v.  is  the  conditional  density  of  x.  given  x, ,...,  x.  .  .  Each  r.  is  a  Gaussit 
J  J  1  J-l  J 

density,  and  we  therefore  need  only  the  two  quantities  (for  each  j  =  1, 2,  . . . ,  N)  , 


sian 


Xj  E  Lxjlxl . X  j-l  J 


and 


a  j  =  E  |_(Xj-  Xj)2|x 


1,  ....  x .  i 

3  -> 


and  we  can  then  write 


.  xj)  =  - 

3  /2TT  •»  . 


i  r  i  a  o~ 

exp  |_-  — j  (Xj  -  x  )  j 

1  2°i 


From  equation  (6),  (7),  (10),  we  can  write 


(8) 


(10) 


T  ,  2  2  . 

L  (aA ,  <jg,  F)  = 


N 


j=l 


— -  exp  ' — — r- 
ffj  L2a2 


,  A  ^21 
(x  .  -x  ■ ) 

J  J  -J 


(11) 


-1/2  ' 

where  we  have  discarded  the  constant  multiples  (2tt)  ,  and  in  which  the  x  depend  on 

2  2  2  2 
X| . x._j  ,  and  cr^ ,  ,  p,  whereas  the  rr  depend  only  on  ,  cr^  ,  p! 


r 


(The  proof  of  tile  last  clause  can  be  carried  out  along  the  following  lines.  The  condi¬ 
tional  density  of  given  j  can  be  written 

^  j  •  •  •  •  ^j.j)  —  *  ■  •  »x  j  _  i )  » ^j) 

where  t  denotes  the  density  of  (x .,...,  x),  and  k(x x.  .)  is  the  normalizing 

r  1  J  1  J-1 

factor  obtainable  from  / 'YX  dx^  =  1  .  Now  t  lias  the  form  given  by  the  r.h.s.  of  equation 
(5),  with  N  replaced  by  j  .  The  x^  appears  only  in  the  exponent,  so  we  can  write 


^(Xjlx^  ....x,^)  =  k’(x1 . x.)  • 


e“l/2  Q 


where 


Q  (x^ ,  •  •  •  t  x  ^  )  A 


x  . 

J 


and  A  is  the  matrix  inverse  of  the  j -rowed  covariance  matrix  of  Xj,...,x  .  by  famil¬ 
iar  methods,  Q  can  be  written 

Q  1  °o  (X1 . *j-l>  *  Ajj  (xj-  xj>2 

so  that 

2  X  _l 

a.  -  a. .  , 

J  JJ 

where  A„  is  the  (j,j)  element  of  A  .  For  ear  purposes,  the  main  point  is  that  tr^  does 
not  depend  on  x ^ . x.  ^  .) 


The  interesting  consequence  of  the  last  remark  is  that 


K  C('j  ’  V  1  =  ^  ■ 


(12) 


so  that  a.  is  the  (unconditional)  mean  square  difference  between  x.  and  (x ^ ,  .... 

x.  ^  .  Now:  the  problem  of  calculating  Xj  and  in  our  context  is  a  special  case  of  the 

Kalman  filter  theory,  which  is  treated,  for  example,  in  reference  (c),  and  which  we  sum¬ 
marize  in  appendix  A,  for  completeness. 
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Calculating  the  Likelihood  Function 

For  a  fixed  sample  x^x^ . x^  and  arbitrary  o^,  cr^  ,  p,  we  require  L,  given 

in  equation  (1 1),  or  equivalently, 


li(crA,  a,},  p)  =  logo  L  . 


(13) 


Appendix  A  shows  how  the  cr  and  of  equation  (1 1)  can  be  calculated  recursively.  The 
essential  equations  are  (A - 17),  (A-18),  (A-20),  (A -21),  and  (A -22),  which  we  repeat  here, 


x,  =0I 

2  ,2,2  . 

°n  =  (aA  +  *B  "  Un-1 


(14) 


(15) 


P  (aA  -  un_x) 

Yn  =  - 2 - 

a  n 


(16) 


1  ?  ' 
U  =  P  ( P  -  Y  )  U  ,  +  P  s  Y 

n  n ;  n-1  M  CTA  Tn 


(17) 


n+1  =  {p  -  V  xn  +  Yn  xn 


(18) 


In  figure  1,  we  have  displayed  a  flow  diagram  for  computing  h(crA,  (7^,  p)  .  The 

2  \ 

letters  V,  H,  C,  U,  Y  stand  for  cr^  ,  h,  y^»  u^,  .  This  efficient  algorithm  for  h 

2 

cun  be  made  faster  by  not  calculating  log(V)  =  log(an)  for  eacli  index,  but  rather  by 

forming  the  products  1 1  crn  recursively  (guard  against  overflow!),  and  then  calculating 
log  (  -fj-  cr^  )  just  once1,  for  each  p)  . 

i 

Estimating  the  Parameters 

For  each  parameter  point  ( a A,  ,  p)  —  a  point  in  3-space  —  we  can  evaluate  h 

and  therefore  L  =  e*1  ,  from  equation  (13).  But  in  place  of  p  ,  which  we  expect  to  be  in 
the  interval  0  <  p  <  1  ,  we  prefer  to  use  the  variable  C  ,  where 

c  =  4-  ‘°e  T^f  •  <o> 


-5- 


Tliis  is  equivalent  to 
e2C-l 

P  =  —rr - =  tanh  (C)  (20 

eSl 

and  we  take  our  parameter  space  to  be  the  set  ol  all  points  (aA,  cr|},  0  in  R  for  which 
all  three  coordinates  are  ^  0  . 


FIG.  1:  ROUTINE  FOR  CALCULATING  h  =  log ,  L 


If  'Y'  (<ta,  an,  O  denotes  our  prior  probability  density,  then  after  the  sample 


(x  . . . ,  x^)  is  observed,  our  posterior  density  is 

$(aA,  aB,  C)=CcpL=Ccpeh 


where  C  is  the  constant  required  to  make  equation  (22)  true: 


0  *  e  d“A  d0B  dt  =  1  • 


j  cp  e  daA  doB  d £ 


We  take  for  our  estimates: 


»A  =  E(«A*  =  \  “A  ^  d<IA  dff B  dC  ■ 


SB  '  E(oB)  =  i  M  'B  ”  d“A  dff B  dC 


£  *  E  <«>  *  j  j  t  S  d-A  d«B 


and  then,  from  equation  (20), 


P  =  tanh  (£) 
def 


We  have  used  for  n  ,  the  constant  (uniform)  density  for  0  s  <ja  <;  D,  0  «  <;  D 

0  <  C<3  ,  where  D2  is  four  times  the  0th  sample  covariance,  i.e.,  we  put 

N  1  v 

D  -  2  •  x2 

N  zL  l 
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r  n  i 

We  note  that  E  —  xf  =  CT^  +  aj^  ,  and  our  D  will  he  on  the  order  of  2(cr^  -t  crjp  ~  . 

-  1 

The  interval  0  <;  £  *-3.0  corresponds  to  0  <  o  <0.9954  . 

With  these  assumptions  on  our  prior,  we  are  in  a  position  to  consider  the  evaluation 
of  integrals  of  the  form 

r-  a 

j  J  J  F  (ffA'aB'C)  (flA  a  B  :)  dflA  d0B  dC  • 

2 

We  are  principally  interested  in  9  such  F’s,  viz.,  F.  -  cr^;  =  rr^;  F,j  =  £;  1  ^  =  or“  ; 

Fj.  =  <7^  rr^ ;  etc.,  i.e.,  we  want  the  1st  and  2nd  moments  of  rr  ^ ,  a^,  C.  and  more 
important,  the  2nd  moments  about  the  means.  For  example,  with  F  =  <y  ^  f,  our 
integral  is,  using  equation  (21): 

3  D  D 

eh(“A-"B':)  d„A  d„B  dC 

o  o  o 

Hi  1  eh(0A-‘’B'cl  d„A  d„B  a; 

o  o  o 


To  calculate  all  the  required  integrals,  of  which  there  are  now  10,  with  the  10th  F 
being  F  =  1,  we  used  the  Monte  Carlo  procedure,  choosing  500  points  (rr^,  a^,  O 

uniformly  randomly  in  the  region  o  <  o\  s  D,  o  <  rr^  s  D,  o  <  C  <  3  ,  and  we  used  the 
approximation 


F.  v  da  .  da„  il "  - 
j  A  B 


for  j  =  l,2,...,9,  where  oOv  denotes  the  vtT  random  point  (a^,  a^,  £)  .  (Since  the 
h  were  on  the  order  of  h  =  -200,  the  sums  in  tlie  previous  equation  were  replaced  by 
sums  of  the  form 


£F  ( a\J  exp  (h^  -  h*), 

v 

where  h*  =  max  h  and,  of  course,  the  constant  factor  exp  (-h*)  cancels  in  numerator 

v 

and  denominator.) 
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EXPERIMENTAL  PROCEDURE 

We  chose  16  parameter  combinations,  viz.,  <7^  =6,4,2, 1,  with  p=  .98,  .90,  .80, 
.50  (4cr  's  times  4  p's),  with  au  =  2  throughout.  For  each  parameter  combination  we 
took  5  independent  samples  x^,...,x^  with  N  =  100  in  every  case.  So  we  collected 
80  independent  samples  in  all.  Each  sample  was  generated  by  the  computer,  programmed 
to  simulate  the  process  {x^}  according  to  the  model  described  earlier. 

The  computer  calculated  and  printed  out  the  estimates  E  (tr^),  E  (cr^),  E  (£)  (expec¬ 
tations  with  respect  to  our  posterior  density  in  the  parameter  space),  together  with  the 
second  moments  about  the  means,  as  described  in  the  last  section.  Tins  was  done  for 
each  of  the  80  independent  samples. 

Experimental  Results 

The  means  (T.,  (Tn,  C  (and  also  p  =  tanh  (^)),  and  the  standard  deviations 


in 


V  d^otj.n/dVj.Vd^O  that  resulted  from  the  80  samples  are  all  presented 
A  13 

appendix  13.  The  mixed  moments  such  as  E  and  the  covariances  were  of  less 

importance  and  are  not  given  here.  (The  covariances  indicated,  as  expected,  that  the 

/v  A  A 

estimates  rr A>  £  were  not  independent  of  each  other.) 

The  main  purpose  of  computing  the  second  central  moments  was  to  provide  a  method 
for  calculating  confidence  intervals  for  the  parameters  cr^,  cr^,  C.  Let  <7^  denote  the 

a,  2 

true  value,  tr^  its  estimate,  and  D  (a  the  second  central  moment  (with  respect  to 
the  posterior  density  associated  with  a  sample  x^,  . . . ,  x^)  . 


Define 


<(,A  ~  JA>‘ 

1,2<°a> 


(24) 


We  have  80  independent  samples  of  this  quantity  when  we  aggregate  all  the  rr^’s  and  p's 

We  calculated  these  quantities  -  or  rather  their  square  roots  -  and  grouped  them  in 
intervals,  and  compared  the  frequencies  with  those  obtainable  on  the  hypothesis  that  x A 

is  distributed  like  chi-square  with  1  degree  of  freedom.  The  comparison  indicated  that 

the  latter  hypothesis  is  a  good  assumption,  i.e.,  that 

value  of  the  standard  unit  normal  variate  (so  x^  is  chi-square  with  1  d.f.). 


x ,  is  distributed  like  the  absolute 
A 
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rile  table  below  show's  tile  observed  and  expected  frequencies 


Interval 

0  -  .5 
.5  -  1.0 
1.0  -  1 .5 

1.5-  2.0 

>  2.0 


Frequencies  of 


Observed  =  O 

28 

21 

18 

6 

7 


X. 
\  A 


,Sa'  °a 


VD  ‘"A1 

Expected  =  H 


30 

25 

14 

7 

4 


O  -  K 

-2 

-4 

4 


The  chi-square  test  applied  here  gives 

=  4.30  , 

which  is  small  for  four  degrees  of  freedom,  and  indicates  a  satisfactory  fit. 
We  did  exactly  the  same  sort  of  calculation  for 

x..  =  %  *  V2 

H  - 


and  for 


0‘  (Ojj) 


(C -cy 

1 


(25) 


i)  (0 


(2b) 


ami  found  excellent  agreement  with  the  hypothesis  that  x^  and  x  ^  are  also  ilist  ributed 
like  chi-square  with  one  degree  of  freedom. 

Using  the  division  points  0,  .5,  1.0,  1.5,  2.0,  «>  ,  as  before,  the  numbers  of 

observations  of  Vx^  in  the  5  intervals  were  35,  27,  11,  3,  4,  yielding  O-K  -  5,2,  -3,  -4,0 

The  chi-square  Q  turns  out  to  be  Q  -  3.4 1  (four  degrees  of  freedom).  1'he  correspond¬ 
ing  numbers  of  observations  of  in  the  5  intervals  were  24,  25,  12,  9,  5,  and  the 

corresponding  Q  was  1.13  . 
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As  a  result,  we  are  justified  in  saying  that  each  of  x^,  x^,  x^  is  like  chi-square 
with  1  d.f.  Accordingly,  if  \  denotes  p  -  percent  value  of  a  normal  deviate,  i.e., 


Too 


V2TT~ 

then  we  can  use  the  intervals 


e  dt 


0 A  ‘  ‘A1' 

<  X  /  2 

P  Vd 

<•»> 

A  | 

_ 

0  B  ‘  0  B 

<  X  12 
p  Vd 

(oB) 

A  . 

t _ 

c  -  c  1 

<  X  2 

P  V.rr 

(C) 

as  (100  -  p)  percent  confidence  intervals  (though  not  all  simultaneously). 

As  an  example,  take  p=  10  percent.  Then  X  =  1.64  .  Suppose  that  we  observe 

r-ij -  P 

(calculate)  £  -  1.26,  \D“(£)  =  .57  .  1'hen  we  can  say 

|  £  -  1.26  |  <  ( l . 64)( .  57) 
or 

.02  <  £<2.20 

at  the  90  percent  level.  Since  o  tanli  (£)  is  a  monotonic  function  of  £  ,  we  can  also 
say 

.31  <  o  <  .97 

at  the  90  percent  level.  If  we  follow  sucli  a  procedure,  then  in  a  long  series  of  such 
independent  experiments  and  assertions,  we  will  be  wrong  about  10  percent  of  the  time. 

CONCLUDING  REMARKS 

We  have  described  a  model  for  the  miss  distances  (plus  or  minus)  Xj,  . .  ,,x 

arising  from  a  single  burst  of  a  rapid-fire  gun.  This  model  contains  3  parameters: 

( 7y  cr^  ,  o  .  The  problem  is  to  estimate  these  3  parameters  from  one  or  more  samples 

(Xj, . . .,  x^),  though  we  concentrated  on  the  case  of  one  sample.  (There  is  no  problem 

in  combining  independent  samples,  because  the  required  likelihood  function  is  obtainable 
from  those  of  each  single  sample  by  multiplying  the  latter  together.) 
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We  recommended  a  Bayes  type  of  procedure,  in  which  we  used  a  uniform  prior  density 
over  a  certain  3 -dimensional  interval  in  the  parameter  space  of  cr^,  er^,  £  =  tanh  (p)  . 

I’he  posterior  density  is  then  the  product  of  the  likelihood  function  and  the  prior,  apart 
from  a  multiplicative  constant  factor.  For  our  estimates,  we  took  the  expectation  of 

and  cTj,  and  £,  relative  to  our  posterior  density.  This  procedure  is  precisely  the  optimal 

procedure  when  our  loss  function  is  the  mean  square  error.  More  precisely,  suppose  we 

concentrate  on  a.,  for  example,  and  somebody  proposes  a  rule  i!r(x ,,...,  x  )  foresti- 
a  ■  in 

mating  er.  ,  i.e.,  observe  (x,,...,x  ),  form  the  real  number  i!'(x.,...,x  )  and  take 
a  in  in 

this  to  be  our  estimate  of  a A  . 


^  A 

For  any  fixed  ,  £  ,  the  totality  of  all  possible  estimates  rr^  =  ’!'(x^,  ...jX^) 

--  (x^,  . . .,  x  )  varying  according  to  its  p.d.f.—  has  a  certain  p.d.f.  We  form  the 

a  2 

iliffercnces  -squared,  (o^  -  O.)  ,  and  their  expected  value: 


o(<7 £  i  y) 


N 


y[¥(xL . V-"A]2 


^  I (*x ^ ,  .  • . ,  x^  |  trA ,  cr^,  £ )dx ^ , . . . ,  dXj^j 


where  f  is  the  probability  density  of  (x^,...x 

this  function  p  ,  depending  on  the  parameters  and  on  ill  ,  is  the  loss  that  we  wish  to  be 
small.  Let  '!■  denote  the  rule  proposed  in  this  paper.  To  say  that  ilr  is  better  than 

is  not  yet  a  meaningful  statement,  for  this  statement,  according  to  our  assumption,  ought 
to  mean  that 

o(fr^,  o^,  £,  y)  <  p  ( crA ,  o'p,  £ ,  i^)  • 

But  for  which  values  of  (a^,  £)?  We  will  not  know,  in  practice,  what  the  parameters 

are--if  we  did,  there  would  be  no  problem.  One  possible  way  to  make  precise  the  state¬ 
ment  is  better  than  "  is  to  use  the  definition:  for  all  possible  values  of  (a^,  <7^,  £), 

we  have 


given  cr^,  ,  c.  We  assume  that 


o(aA,  aB,  £:  *')  <  o  (aA,  oR,  £;  tQ)  , 

and  for  some  (crA,  a.,  £)  the  inequality  is  strict. 

A  I) 

But  it  is  not  hard  to  prove  the  following  for  our  problem:  let  <r>(o^,  Cfy  £)  denote  any 

prior  probability  density  that  is  non-zero  in  every  region  of  3-space  where  rrA  >  0  , 

a,  >  0,  and  lei  t  be  the  function  for  which 
B  o 
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r 


J'J'J'  p  <  '  °B '  C  •'  <WC)  dV°BdC 


is  smallest.  Tliis  rule  ’]'  will  turn  out  to  he 

o 


*o(xl . xn’  = 


I'l'i  CA  L|V°B'C)  dV0BdC 


^(oAaBC)  do^d^dC 


Then  there  is  uo  other  ^  about  which  one  can  say,  "t  is  better  than  ’!’  In  more 

3  o 

technical  language,  'h  is  "admissible." 


In  our  situation,  we  took  a  prior  ^(cr^,  a^,  C)  tluil  was  zero  lor  C  < 0  or  (  >  3.00 
1/2 


rN 


and  for  a ^  >  2 


T  ^ 

V  1 


-  D  and  for  >  D  .  The  reason  was  that  we  knew  that  (  >  0 , 


at  least  in  our  simulations  and  very  likely  in  practice  (although  it  may  be  safer  to  take 
the  interval  -  .5  <  Z  <?  3.0  in  practice),  and  in  no  simulation,  of  which  we  have  run  about 

N  2  i  2  9 

200,  have  we  ever  observed  a  case  with  £  x.  <  j(cta  +  a  jp  •  So  die  latter  event  is  one 

1 


with  very  small  probability,  and  we  ignore  that  possibility  in  our  procedure. 


It  is,  consequently,  unlikely  that  anyone  will  produce  a  rule  •!'  that  is  (uniformly,  for 
all  o^,  cfy  0  better  tlian  our  rule  i!^  ,  and  of  course,  the  same  also  applies  to  the  other 

parameters  Ojj  and  Z  . 
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APPENDIX  A 


We  consider  the  stationary  Gaussian  process  x  j ,  X£,  . . .,  x^,  . . .  with  means  zero, 
and  covariances 


E[x.x.l  =  a2  ^  +  a2  6, 

1  j  A  B  13 


(A-l) 


2  _  2 


w  ith  cr  “y  a  Jj  ,  p  assumed  known.  The  problem  is  to  calculate  recursively 


x  =  E  x  lx,,...,  x  .  1 
n  n'  1  n-1 


(A-2) 


a2  =  Er(x  -  x  )2^ 
n  n  n 


(A-3) 


It  is  well  known  that  lor  Gaussian  variables  fx  },  the  x  of  (A-2),  is  a  linear 

n  n  '  ' 

combination  of  x^,  . . . ,  x  ^  ,  and  in  fact,  it  is  that  linear  combination  for  which 


Q(xn)  =  (xl .  Xn-1'  Xn)  An  1 

Vr 

\  n  / 


(A -4) 


is  a  minimum.  Here  A  is  the  matrix  inverse  to  the  covariance  matrix  of  x.,  ...,x  . 

n  I  n 

It  follows  that 


n-1  A  . 

xn  =  ‘  S.  1T~  xj 

i"  ri  n  J 


(A -5) 


where  (A  . . A  )  is  the  nth  row  of  A  .  But  (A-5)  is  not  very  useful,  since  we  don't 

nr  nn  —  n 

liave  simple  formulas  for  the  A  .  . 

1  nj 

Let  C..  =  cov(x.,x.),  and  form  the  sums 
ij  1  J 


y  G..A  .  =  5.  -  C.  A 

11  ni  m  in  nn 
J  =  l  '  J 


(A-b) 


l’herefore,  the  coefficients 


Anj/Ann  *n  sat*st'y  the  (n -  L )  equations 


n-1 

ft* 


ij 


(-£) 


=  C. 


m 


(A -7) 


for  i  =  L,  2,  . . n-1  .  So,  when  we  nut 
n-1 


t  =Y'  y.x.  , 
n  pv  J  J 


(A-») 


the  (y  ,  ...y  j)  vector  is  the  unique  solution  to  the  matrix  equation  equivalent  to  (A -7): 


n-1 


'In 


'n-1 ,  n 


(A-y) 


where  C  is  the  (n-l)-rowed  matrix  with  elements  C... 

U 


The  equation  (A -9)  can  be  solved  recursively  in  a  simple  manner.  Indeed,  let 
(y’1»...»Y^  y,  Y^)  be  the  corresponding  coefficients  when  n  is  replaced  by  n+1.  Then 

we  have  to  solve  the  equation 


We  observe  that 


lc 

"C, 

1 ,  n+1 

• 

• 

=  P 

1 » n 

• 

_Cn-l ,  n+1. 

• 

C  . 
n-1 ,  n. 

Y1 

■  - 

Cl,n+1 

m 

•  f 

• 

— 

• 

Ci 

Cn-1 . n+1 

c 

n ,  n+1 

»  •« 

(A- 11) 


A-2 


using  (A-l).  So  the  first  n-1  rows  of  (A-10)  can  be  written 


c 

Yi 

=  “Y 1 

?ln 

+  P 

Cln 

- 1 

H 

1 

-  c 

>- 

_ 1 

n 

i — 

o 

3 

1 

H* 

3 

1 _ 

- 1 

c 

rH 

i 

c 

u 

_ i 

and  when  this  is  compared  with  (A -9),  we  see  that 


-  « 

Yi 

Yi 

; 

=  (p  y ' ) 

• 

n 

• 

Yn-1 

_Yn-l_ 

(A -hi) 


and  this  shows  how  the  y!  are  obtainable  from  y.  for  is  n-1,  once  y'  is  known.  The 

i  1  i  J  '  n 


J 

last  row  of  (A-10)  is  now  used,  to  give 

y  i 


J 


(C  . . C  ) 

nl  n,n-l 


y 


n-1 


(p-y* )  +  y'C  =  C 

Tn  Tn  nn  n,n+l 


in  which  (A-13)  has  been  used.  This  equation  yields 


(A -14) 


n-1 


y  = 
n 


,  -  oY  C  ,y. 
n,n+l  nj  j 

n-I 

C  -  V  C  .y. 
nn  nj  j 


It  is  convenient  here  to  introduce  the  quantities 
n-1 

u  .  =  T'  c  .y.  . 

n-1  nj  j 


(A- IS) 


A -3 


(A- 16) 


Then  (A-15)  becomes 


,  °*A  '  pVl 

1 


Yn  = 


(T.  +  (Tn  -  U  , 

A  B  n-L 


( A  - 17 ) 


making  use  of  (A-l).  Finally,  replacing  n  by  n+1  in  (A -16)  yields 


u  =  £c  .y! 
n  pi  n+l»J  J 


n-l 


=  0 


T  C  Y. 

j=l  n>J  J 


(p  'vii)+  (",Avn 


where  we  have  used  (A -11),  (A-13),  and  (A-l).  With  (A-16),  we  now  can  write 


2 ,, , 


un“  0(0 ’vn>  Vl  +  (®Ayi  ' 


(A-l  s) 


The  recursive  solution  for  the  y's  is  now  complete.  But  the  y's  themselves  are  not  what 
we  want;  it  is  the  x's  ,  and  the  y's  are  merely  a  means  to  an  end. 

Suppose  then  that  we  have  obtained  x^  ,  and  u  ^  .  From  (A-8),  with  n+1  in  place 
of  n  ,  we  get 


n-l 

x  ,  .  =  E  v!  x.  +  y'  x 
n+l  *r*  jj  n  n 
j  =  l  J  J 

We  use  (A-13)  now  to  write 

n-l 

A  c ' 

X  =2.  v.  x .  (  D  -  Y'  )  +  Y  X 

n+1  r-1,  jj  iv  n  n 

J=l  J 


(A -Id) 


i .  e . , 


A  -  ,  X  N  1 

x  ,  .  =  \  o  -  Y  )  x  +  v'x 
n+l  n  n  n  n 


(A -20) 


This  equation  permits  the  recursive  calculation  of  x^  when  combined  with  (A- 17)  and 
(A -18).  The  starting  values  are 


x,=0j 
u  =  O1 

O 


(A -21) 


A -4 


L 


Finally,  the  a  of  (A -3)  are  obtainable  from 
n 


<r2  =  E(x2 )  -  E(x2)  =  ct2  +  r2  -  (y. . v  . ) 

n  '  n'  '  ir  A  B  1  n-1 


C 


in 


c 


II  - 1 ,  II 


i.e., 


9  9  9 

=  (T  .  +  n~  -  u  . 
n  A  B  n-1 


(A -22) 


I'he  only  equations  required  for  the  actual  computations  are  (A-17),  (A-18),  (A-2U), 
(A -21),  and  (A -22). 
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AFPliNDIX  B 


The  following  7  tables  display  the  estimates  that  resulted  from  80  independent  samples 
of  time  series  x^,  •••*x]00  ^m^ss  distances  in  a  burst  of  100  rounds),  simulated  by  the 

computer.  The  heading  of  each  table  defines  the  quantity  whose  values  are  tabulated  in  the 
body  of  the  table.  The  4  numbers  on  the  left  and  the  4  at  the  top  represent,  respectively, 
the  true  value  of  p  (or  £)  and  .  The  value  of  was  taken  to  be  2.0  throughout. 

For  each  p  there  are  5  numbers  presented,  which  correspond  to  the  5  independent 

samples  used  for  each  parameter  combination.  See  the  main  text  for  additional  details. 

As  an  example,  the  estimates  for  viz.,  E(crB),  for  p=.80  and  <J^  =  4  were 
1.4,  1.8,  1.8,  2.3,  2.3. 

The  parameters  p  and  £  are  related  to  each  other  by  the  equation 
o  =  tanh(£). 

We  used  £  in  the  computer  calculations. 


B-l 


FABLE  B-l 


[?  Fff  1 

Ll(7Aj 


"A 

p 

6 

_  .  4.  .  J 

2 

] 

.98 

4.4 

7.0 

2.  8 

2.  2 

8.6 

7.  2 

3.4 

1.6 

10.  7 

4.  8 

1.  8 

1.  0 

3.9 

5.  0 

2.  1 

1.  1 

8.7 

6.  3 

1.7 

1.8 

.90 

10.  7 

3.  8 

2.6 

1.6 

4.  8 

5.  •' 

2.  1 

1.  2 

4.  4 

7.  o 

1.9 

0.9 

7.  1 

4.7 

2.  1 

1.  4 

5.  1 

5.  0 

2.0 

1.  7 

.  80 

5.  4 

4.  9 

2.  2 

1.  6 

8.  8 

3.  8 

3.  4 

0.9 

7.9 

6.  5 

1.  8 

1.  1 

4.9 

5.  2 

2.  3 

0.  8 

7.  0 

7.  2 

3.4 

2.4 

.  50 

5.  5 

4.  0 

2.2 

2.  4 

5.  3 

3.  8 

1.  8 

1.  1 

5.  0 

3.9 

2.  0 

1.  6 

4.  7 

4.  1 

1.  2 

0.  6 

_ 1 

o.  4 

4.  3 

1.  9 

0.  7 

B-2 


TABLE  B-V 


17 


[d2(oa)1*' 


% 


"A 

p 

6 
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